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Abstract. A generalized word in two positive definite matrices A and B is 
a finite product of nonzero real powers of A and B. Symmetric words in 
positive definite A and B are positive definite, and so for fixed B, we can 
view a symmetric word, S{A,B), as a map from the set of positive definite 
matrices into itself. Given positive definite P, B, and a symmetric word, 
S{A,B), with positive powers of A, we define a symmetric word equation as 
an equation of the form S{A,B) = P. Such an equation is solvable if there is 
always a positive definite solution A for any given B and P. We prove that all 
symmetric word equations are solvable. Applications of this fact, methods for 
solution, questions about unique solvability (injectivity), and generalizations 
are also discussed. 



1. Introduction 

A generalized word {g-word, for short) W — W{A, B) in two letters A and B is 
an expression of the form 

W = A^^B^'^A^'^ B'^'^ ■ ■ . AP'' B'"' A''''+^ 

in which the exponents pi and qi are real numbers such that pi, qi ^ 0, i = 1, . . . , 
and Pk+i is an arbitrary real number. We call k the class number of W. The 
reversal of the g-word W is W* = AP"*^ B'^" AP" ■ ■ ■ B^^ Ap^ B'^^ Ap^ and a g-word is 
symmetric if it is identical to its reversal (in other contexts, the name "palindromic" 
is also used). We will call a g-word, W, A-positive (A-negative) if all exponents of 
A in are positive (negative). 

We are interested in the matrices that result when the two letters are (indepen- 
dent) positive definite (complex Hermitian) n-by-n matrices (PD, for short). For 
convenience, the letters A, B will also represent the substituted PD matrices (the 
context will make the distinction clear). To make sure that W is well-defined after 
substitution, we take primary PD powers (see H p. 433] and |, p. 413]). I.e. 
given p £ K\{0}, a unitary matrix U, and a nonnegative diagonal matrix D, we 
have {UDU* f = UDPU*. 

Our primary interest will be matrix equations involving A-positive symmetric 
g-words. 
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Definition 1.1. A symmetric word equation is an equation, S{A, B) = P, in which 
S{A, B) is an A-positive symmetric g-word. If B and P are given positive definite 
matrices, any positive definite matrix A for which the equation holds is called a 
solution to the symmetric word equation. 

A symmetric word equation will be called solvable if there exists a solution for 
every pair of positive definite n-hy-n B,P. Moreover, if each such pair gives rise to 
a unique solution, the equation will be called uniquely solvable. It is clear that the 
(unique) solvability of A-positive word equations implies the same as for A-negative 
equations (by replacing A with A^^), and hence, no generality is lost in Definition 
1.1. As we shall soon see, the restriction of the definition above to ^-positive words 
is important. 

We first encountered symmetric word equations while studying a trace conjecture 
|4j in the case of words with positive integral powers. The conjecture is 

Conjecture 1.2. A word has positive trace for every pair of real positive definite 
matrices if and only if the word is symmetric or a product (juxtaposition) of 2 
symmetric words. 

It turns out that for each solvable symmetric word equation, we can identify an 
infinite class of words that admit real PD matrices A and B giving those words 
a negative trace. Our investigation of these equations, although useful for our 
methods in Q], show that they have a rich structure of their own, some of which 
we explore here. (See Section 7 for a generalization of the notion of "symmetric 
word equation" defined above). A natural question to ask is if it is necessary 
to focus attention on symmetric g-words. We remark that it can be shown that 
symmetric g-words are the only g-words that are positive definite for all positive 
definite matrices A, B. In light of Definition 1.1, our restriction seems appropriate. 

2. Symmetric words 

Recall that two n-hy-n matrices X and Y are said to be congruent if there is 
an invertible n-hy-n matrix Z such that Y = Z*XZ] and that congruence on 
Hermitian matrices preserves inertia (the ordered triple consisting of the number of 
positive, negative, and zero eigenvalues) and, thus, positive definiteness ||, p. 223]. 
A symmetric word of class k in two positive definite matrices is congruent to one of 
class fc — 1, iteration of which implies congruence to the "center," class 0, positive 
definite matrix. We conclude that 

Lemma 2.1. A symmetric g-word in two positive definite matrices is positive def- 
inite. 

A natural question to ask is if the map from the set of PD matrices to itself given 
hy h : A^ S{A^ B) is surjective. Our main result answers this in the affirmative. 

Theorem 2.2. Every symmetric word equation is solvable. 

We remark that the statement is not generally valid if the definition of "sym- 
metric word equation" is widened to allow mixed sign powers of A^ even in the case 
n > 1. Let / denote the n-by-n identity matrix and let B ^ I be an n-by-n PD 
matrix. Then, the equation, A~^BA^BA~^ = /, has no PD solution. For if there 
were a PD solution A, then ABA^^ — U, for some unitary U. Hence, B — A^^UA 
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has eigenvalues on the unit disc. The only positive definite matrix for which this is 
so is i? = /. 

We believe that h is also injective, but this is proven only in special cases. This 

is 

Conjecture 2.3. Every symmetric word equation is uniquely solvable. 

3. Some specific equations 

In what follows, we shall say that two symmetric word equations are (uniquely) 
equivalent if for each, its (unique) solvability implies the (unique) solvability of the 
other. For instance, the equations ABA^BA — P and ABA — P are uniquely 
equivalent because PD matrices have unique PD square roots. As a more subtle 
example, the equations A^BABA^ = P and ABA'^BA = P are seen to be uniquely 
equivalent by setting A = B^^/'^XB-^/'^ in A'^BABA'^ = P. Additionally, the 
same substitution gives us that for any integer fc > 2, A^{BA)^^^BA'^ = P and 
ABA^BA — P are uniquely equivalent. 

It is an easy exercise that the facts below follow from our definition of unique 
equivalence: 

(1) For r > and s ^ 0, the equations S{A'',B') = P and S{A,B) = P are 
uniquely equivalent. 

(2) For each positive integer k, S{A^ B)^ ~ P and S{A, B) = P are uniquely 
equivalent. 

(3) For s real, the equations B''S{A, B)B'' = P and S{A, B) = P are uniquely 
equivalent. 

The simplest examples of symmetric word equations are {A'^ — P, q & M\{0}}. 
These satisfy Conjecture 2.3 by uniqueness of PD g*'* roots. More interesting is the 
first non-trivial equation, ABA = P. This equation has arisen in other contexts 
We indicate some aspects of this equation useful to us. 

Theorem 3.1. The equation, ABA = P, has a unique solution for each pair of 
PD matrices B, P. Moreover, the unique PD matrix A is given by 

A = B-'/^iB'/^PB'/y^^B-'/\ 

Proof. Assume that P and B are given positive definite matrices and ^ is a PD 
solution to ABA = P. Set X = B^^^AB^^^, which is PD by Lemma 2.1, so that 
A = Then, 

P = p-^^'^XB-^/'^BB-^/'^XB-^^'^ = S^i/^AT^B^i/^ 

Therefore, X^ — B^/^ PB^/^ , from which it follows that X is uniquely determined 
as {B^I'^PB^/^fl'^. Hence, A must be B~^/^{B^I'^PB^/^fl'^B-^/^. Finally, sub- 
stituting this positive definite A (by Lemma 2.1 again) into the original equation 
does verify that it is a solution. □ 

As in 10] , given two PD matrices C and D we will denote Ci^D as the PD matrix, 

C#D = Ci/2(C-i/2i?C-i/2)i/2ci/2^ 

the so-called geometric mean of C and D. Notice that from Theorem 3.1, ABA = P 
has the unique solution A = {B~^)^P. Writing this equation (by inverting) as 
B~^ = AP~^A, we also have that A = P^{B~^), and from this comes the not so 
obvious fact that C#D = D#C. 
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4. Fixed points and a sequence lemma 

For more complex symmetric word equations, it is not clear that there should 
be an explicit formula for a solution, as in Theorem 3.1. We do not know one, for 
example, for A^BABA^ = P. Since our proof of the solvability of this equation 
(and all others) will use fixed-point theory, we record a useful theorem of Brouwer 
i 

Theorem 4.1 (Brouwer's fixed point theorem). If M is a compact, convex subset 
of a finite dimensional Banach space and if f : M ^ M is a continuous function, 
then there is a fixed point, p, for f in M . 

We will be using the spectral matrix norm throughout (see p. 295]). This 
norm is useful because for positive semidefinite A, it is just the largest eigenvalue of 
A. Brouwer's result is an important ingredient in the proof of Theorem 2.2. Before 
proving Theorem 2.2, we record the following. 

Lemma 4.2. Suppose {Ak} is a convergent sequence of positive definite matrices. 
Then, there is a subsequence {■^kj}j_^ such that 



lim 



{Ak^+i/k,y 



{Ak,+I/k,)-' 



exists. 



Proof. Set Ak = UkDkU^ for unitary Uk and = diag(Aife, \2k, . . . , A„fc) in which 
< Aife < X2k < ■ • • < A„fc are the eigenvalues of Ak. For each m G {!,... ,n}, 
define new sequences, 

, Aifc + 1/fc 

Amk + 1/fc 

Notice that hi{k) = 1 for all k. We now show by induction that there exists 
a subsequence, {kj}"^^, such that each of {hjn{kj)}°^-^ converges. To simplify 
matters later, we first assume that {Uk} converges to a unitary U by passing to a 
subsequence (the set of unitary matrices is compact). 

Let M G {1, . . . , n} be such that there exists a subsequence {kj}°°^^ making each 
of {^i(fcj)}°liv • ■i{'iM(fcj)}°li converge. Such an M clearly exists (e.g. M — 1). 
If M = n, then there is nothing to prove. Otherwise, examine the inequality, 

Xik + l/fci 

o<hM+i{k,)= 7 <1, 

which holds for all j. Since this is a bounded sequence, choose a subsequence of 
the sequence, {kj}J^^, making this ratio converge to some nonnegative number. 
This will not alter the convergence of the first M sequences. This completes the 
induction. 

We conclude that there exists a subsequence, {kj}J^-^, such that 

Xik- + Ilk, 
lim = km > 0. 



1/fc, 
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We now claim that lim 



exists. But the matrix in question is just 



/ ^ikj +i/fc, 

Aife,.+l//cj 



v 







which by construction has the hmit, 

/I 
U : 

This proves the lemma. 



\ 



u: 



u* 



□ 



5. All symmetric word equations are solvable 
We may now prove the main result. 

Proof of Theorem 2.2. Notice (using (1), (3) above) that it is equivalent to study 
symmetric word equations of the form AS{A^ B)A = P, where S{A, B) is a sym- 
metric g-word beginning and ending with a power of A. For instance, the equation 
A^/'^BABA^I'^ = P is (uniquely) equivalent to A{ABA'^BA)A = P. The convex, 
compact set to which we apply Brouwer's theorem is 

M = {A I A is positive semidefinite and ||A|| < 1} 

Define Sa to denote the sum of all the powers of A in S{A,B). Also, let S-b 
{Sb) denote the sum of all negative (positive) powers of B in S{A, B). Let fc be a 
positive integer. Define a function ft on the set M as follows: 



P 



fkiX) = 



#{s{X + I/k,B)-^) 
9k{X) 

pl/2 (p-l/2^(^ + J/fc^ S)-lp-l/2) pl/2 



where ■ M - 
9k{X) 



^+ is defined by, 

-l||l/2 



IPII P- 



B\\ 



9k{X) 



3/2 ||^_i||Sb/2 



{x + i/ky 



Sa/2 



From the properties of the spectral matrix norm and the fact that S{C, D) ^ 
S{C-^,D-^), it follows that fk is bounded by 1. I.e., 



IPII P- 



IPI 



5/2 



B- 



\\fk{X)\ 



< 



{x + i/ky 



Sa/2 



9k{X) 

We should note that in the 1-by-l case, we simply have fk{X) = 1, so that the 
unique fixed point for fk{X) is given by X = 1. More generally, from the properties 
of the geometric mean and since fc > 0, it is also clear that fk{X) is positive definite. 
Hence, fk{X) € M. From the discussion of primary matrix functions ||, p. 433] 
for normal matrices, it is seen that for g 7^ 0, A is a continuous function on M. 
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Therefore, fk is also continuous on M (it is made up of compositions and products 
of continuous functions). 

Now, apply Brouwer's fixed point theorem to give us = fk{A.k). Because fk 
is always positive definite, we must have that Ai- is nonsingular. Hence, from the 
properties of the geometric mean (Theorem 3.1), we have 

(5.1) gk{AkfAkS{Ak + I/k, B)Ak = P. 

Because {Ak}^^i is an infinite, bounded sequence of PD matrices, there is a 
subsequence that converges to a positive semidefinite matrix T. We will therefore 
assume that {Ak} converges. If T is actually positive definite, then we have 

^2 4. A. , r/;. m A. -TS{T,B)T goo{Tf 



P= lim gkiAkyAkSiAk 

k — ^oo 



in which goo{T) 
easily seen that 



\\P\\ P- 



,1/2 



B 



-I/k,B) Ak 

|S_b/2 ||^_i 



Sb/2 



Then, it is 



A = T-g^{Tf/^^+^^ 



is our desired PD solution. 

Since (5.1) has no limit interpretation if T is singular, we now show that T is 
necessarily positive definite. For each fc, form the decomposition, Ak = UkDkU^, 



with unitary Uk and Dk = diag(Aife,A 



2fe. 



, Xnk), where < Aifc < A2fe < . . . < 



X„k are the eigenvalues of Ak- Now, suppose that {Ak} converges to a singular 
matrix T with < Ai < A2 < . . . < A„ being the eigenvalues of T. Since T 

is assumed to be singular, let m > 1 be such that {Ai, A2, . . . , A„i} are the zero 

2 II II TJ\\S-B \\q-1\iSb 



eigenvalues of T. Set c = ||P || 11-^ 
(following from inverting equation (5.1)) 

(Ak + I/k)- 



, and examine the equality 



S{{Ak + I/k)-\B-^) = cAkP-^Ak 



which is just 



(5.2) 



cAkP-^Ak. 



(Ak + I/k)-' 

Prom Lemma 4.2, there is a subsequence {Ak- }°^^ such that {Uk^ }'jLi converges 

(Ak^+i/kj) g-jj-jg^g fgj. g^ii 7- > 0. Moreover, 



to some unitary U and such that lim ,, , 

i^oo ||(A,^+///c,-) II 

this limit is equal to 

/I ... 0\ 
(5.3) [/:•._ : U* 

\o ■■■ K) 

for some /i2, /13, . . . , /i^ > 0. If /12, /is, • • • , /in are all nonzero, then the limit as 
J ^ 00 of 

' {Ak^+I/k,)-^ 



{Ak^ + I/k^ 



,B- 



is invertible, while the limit of cAk^P ^Akj is singular. Whence, ht = for some 
t e {2, 3, . . . , n}. Moreover, if s > t, then hg = as well; this coming from the fact 
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that 

by our ordering of the eigenvalues of Ak ■ So assume that t is the largest element of 
{1, 2, . . . , n} such that ht ^ 0. Then, our limit (5.3) looks like 

L^u(^^' U* = UE'V* 

in which Eh ~ diag(l, /i2, . . . , ht) is positive and E — diag(l, . . . , /it, 0, . . . , 0). 
Now, set B = U*B~^U . Then, the left hand side of (5.2) (within the subsequence 
above) converges to 

(5.4) S{UEU*,B-^) =UEP'Bi'EP^B'i^ ■■■B'^^EP^Bi^EPW*. 

We claim (5.4) has the form 

for some PD t-hy-t matrix H. Indeed, the center matrix in (5.4) is of the form E^ or 
E^B'^EP, each being the direct sum of a t-hy-t PD matrix and an (n — t)-hy-{n — t) 
zero matrix. We now induct on the form of (5.4). Assume that EP'^B'''^ ■ ■ ■ B'^'^EP'^ 
is a direct sum of a t-hy-t PD matrix Hd with a zero matrix. Form the partition, 



(5.5) B - 

\i'21 £'22 

in which Bn is a t-hy-t PD matrix, B22 is an (n — t)-hy-{n — t) PD matrix and 
B12 = (sec H, p. 472]). Then, 

EP^-^B'i^-^ (Hd 0\ ^ ^p,_, ( Bii B12A /ffd OW Bii B12 I ^p,_ 

U Uy ^£^21 £^22/ \ U Uy \^i321 £^22 

= EP-'-^ fBiiHdBii BiiHdBi2\ ^p^_, 
\B21HdB11 B21HC1B12J 



is of the desired form, completing the induction. 

Examine now the right hand side of (5.2), cAf^.P^^Ai^., which converges to 
cTP^^T. Since the left hand side limit by above has rank t > 0, T cannot have all 
its eigenvalues equal to 0. Whence, we can write this limit as cUDU*P~^UDU*, 

in which _D = and A is the positive diagonal matrix, diag(Am_(-i, . . . , A„). 

Setting these two expressions equal gives us 



0\^,p_,^fO 0\ [H 



OAy \{) K) \0 

Of course, this equality is impossible. We simply compare the two (n, n) entries 
of the left and right hand sides to arrive at a contradiction (one is positive, the 
other zero). Hence, T is invertible, completing the proof. □ 

We remark that if P and B are chosen to be real, then the proof above shows 
that the solution A may be chosen to be real as well. 
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6. Approximate solutions 

We now make some remarks about finding approximate solutions to symmetric 
word equations. As a first approacli one would hope that an iteration of the func- 
tion defined in the proof of Theorem 2.2 would give rise to approximate solutions. 
Unfortunately, experimentation shows this not to be the case. We detail another 
method that has been quite effective in practice at finding solutions (in the case of 
positive integral powers) and in verifying our conjecture of unique solvability. 

Given a symmetric word S{A, B) with positive integral exponents, start with 
an initial PD matrix, Aq, (usually /), and expand the expression, S{Aq + D,B). 
Consider the formal sum, S'{Aq + D,B), of the terms in this expansion with at 
most a single D. Now, solve the linear system 

S'{Ao+D,B) = P 

for the matrix D and set Ai ^ Ao + Z?. Repeating this process gives our algorithm. 
As a simple example, the repeated equations for ABA = P are given by 

A.^iBD + DBA.^i = P - 

A, = Ai^i+D. 

Curiously enough, there seems to be no guarantee in general that these Ai will 
be positive definite (or even Hermitian), nor is it clear that the linear system for 
D above will always have a solution. Nonetheless, experimentation has shown 
that these iterations always converge to the same PD solution regardless of initial 
starting point. 

7. A GENERALIZATION 

We close by noting a generalization of Theorem 2.2 to a larger class of ma- 
trix equations. If Ci,...,Cm is any list of m invertible n-by-n matrices and 
W{A; Ci, . . . , C„) = AP^CiAP^C2 ■ ■ ■ C^Ap^C^Ap^ is such that W{A; Ci, . . . , Cm)* 
= W{A; Ci, . . . , Cm), we call W a generalized symmetric word. It is again an ele- 
mentary exercise in congruence that, if A is PD and W is a generalized symmetric 
word, then W{A-,Ci, . . . ,Cm) is positive definite. For a given PD matrix P and 
invertible Ci , . . . , Cm we call 

W{A;Ci,...,Cm)^ P 

with each > 0, a generalized symmetric word equation in the PD variable A. 
Though we are not motivated by any particular application, generalized symmetric 
word equations are natural to consider on a theoretical level. Our proof of Theo- 
rem 2.2 goes over directly to generalized symmetric word equations, except that a 
further technical condition on the Ci 's is needed. That condition is that the leading 
principal minors of any unitary similarity of C~^ should be nonzero. Of course, 
this means that any principal minor of any unitary similarity of C^^ should be 
nonzero for i = 1, . . . , m. Using elementary facts about the field of values of an 
n-hy-n matrix C: 

F{C) = {x*Cx \x*x^l, xe C"}, 
see chapter 1 of the latter condition may easily be seen to be equivalent to the 
statement that ^ F{C). For purposes of this work, we call such a matrix com- 
pletely invertible. Of course, any power of a positive definite matrix B is completely 
invertible, and this is what is essential in the proof of Theorem 2.2. For generalized 
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symmetric word equations, the analysis of (5.4) remains valid because any partition 
as in (5.5) will have an invertible leading principle submatrix. We conclude 

Theorem 7.1. For completely invertible C\, . . . ,Cm, the generalized symmetric 
word equation 

W{A-Cu...,C^) = P 
is solvable for any positive definite P. 

We have no example showing that complete invertibility cannot be replaced by 
invertibility. 
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